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1 Introduction 

We define a new grammar formalism called Link- 
Sharing Tree Adjoining Grammar (LSTAG) which 
arises directly out of a concern for distinguishing the 
notion of constituency from the notion of relating lex- 
cal items in terms of linguistic dependency^ iMcl'cuk 



1988| ; |Rambow and Joshi, 1992| ). This work de- 



rives directly from work on Tree Adjoining Gram- 
mars (TAG) ( Joshi, Levy, and Takahashi, 1975| ) where 
these two notions are conflated. The set of derived 
trees for a TAG correspond to the traditional notions 
of constituency while the derivation trees of a TAG 



are closely related to dependency structure ( Rambow 



and Joshi, 1992). A salient feature of TAG is the ex- 



tended domain of locality it provides for stating these 
dependencies. Each elementary tree can be associ- 
ated wi th a lexical item giving u s a lexicalized TAG 
(LTAG)( Joshi and Schabes, 1991 ). Properties related 



to the lexical item such as subcatcgorization, agree- 
ment, and certain types of word-order variation can 



be expressed directly in the elementary tree (Kroch 
1987|; |Frank, 1992]). Thus, in an LTAG all of these 



linguistic dependencies are expressed locally in the 
elementary trees of the grammar. This means that 



* Thanks to Christy Doran, Aravind Joshi, Nobo Komagata, 
Owen Rambow, and B. Srinivas for their helpful comments and 
discussion. 

1 The term dependency is used here broadly to include for- 
mal relationships such as case and agreement and other rela- 
tionships such as filler-gap. 



the predicate and its arguments are always topologi- 
cally situated in the same elementary tree. 

However, in coordination of predicates, e.g. 
the dependencies between predicate and argument 
cannot be represented in a TAG elementary tree di- 
rectly, since several elementary trees seem to be 'shar- 
ing' their arguments. 

(1) a. Kiki frolics, sings and plays all day. 

b. Kiki likes and Bill thinks Janet likes soccer. 



The idea behind LSTAG is that the non-local na- 
ture of coordination as in 



1) (for TAG-like gram- 



mar formalisms) can be captured by introducing a 
restricted degree of synchronized parallelism into the 
TAG rewriting system w hile retaining the existing 
independent parallelismp f Engclfriet, Rozcnbcrg, and| 



r_p; 

Blutzki, 1980| ; Rambow and Satta, to appear ). We 



believe that an approach towards coordination that 
explicitly distinguishes the dependencies from the 
constituency gives a better formal understanding of 
its representation when compared to previous ap- 
proaches that use tree-rewri ting systems which con- 



flate the two i ssues, as in (Joshi, 1990 ; Joshi and 



Schabes, 1991; Barkar and Joshi, 1996) which have 



2 It is important to note that while the adjunction opera- 
tion in TAGs is "context-free" , synchronized parallelism could 
be attributed to the TAG formalism due to the string wrap- 
ping capabilities of adjunction, since synchronized parallelism 
is concerned with how strings are derived in a rewriting sys- 
tem. We note this as a conjecture but will not attempt to 
prove it here. 



to represent sentences such as (1) with either un- 
rooted trees or by performing structure merging on 
the derived tree. Other formalisms for coordination 
have similar motivations: however their approaches 
differ, e.g. CCG ( |5teedman, 1985| ; fBteedman, 1997§ ) 
extends the notion of constituency, while generative 



syntacticians (Moltmann, 1992; Muadz, 1991) work 



with three-dimensional syntactic trees. 



2 Synchronized Parallelism 

The terms synchronized parallelism and independent 
parallelism arise from work done on a family of 
formalisms termed parallel rewriting systems that 
extend context-free grammars (CFG) by the addi- 



tion of various restrictive devices (see (Engelfriet 
Rozenberg, and Slutzki, 1980| ) )) . Synchronized par- 



allelism allows derivations which include substrings 
which have been generated by a common (or shared) 
underlying derivation process^. Independent paral- 
lelism corresponds to the instantiations of indepen- 
dent derivation processes which are then combined 
to give the entire derivation of a string^]. What we 
are exploring in this paper is an example of a mixed 
system with both independent and synchronous par- 
allelism. 



In ( Rambow and Satta, to appear ) it is shown 
that by allowing an unbounded degree of synchro- 
nized parallelism we get systems that are too uncon- 
strained. However, interesting subfamilies arise when 
the synchronous parallelism is bounded to a finite de- 
gree, i.e. only a bounded number of subderivations 
can be synchronized in a given grammar. The system 
we define has this property. 



3 LSTAG 

We first look at the formalism of Synchronous TAG 
(STAG)( f5hTeber and Schabes, 1990| ) since it is an ex- 



3 The Lindenmayer systems are examples of systems with 
only synchronous parallelism and it is interesting to note that 
these L systems have the anti-AFL property (where none of 
the standard closures apply). 

4 CFG is a formalism that only has independent parallelism. 



ample of a tree-rewriting system that has synchro- 
nized parallelism. 

As a preliminary we first informally define Tree 
Adjoining Grammars (TAG). For example, Figure [l] 
shows an example of a tree for a transitive verb 
cooked. Each node in the tree has a unique address 
obtained by applying a Gorn tree addressing scheme. 
For instance, the object NP has address 2.2. In 
the TAG formalism, trees can be composed using the 
two operations of substitution (corresponds to string 
concatenation) and adjunction (corresponds to string 
wrapping). A history of these operations on elemen- 
tary trees in the form of a derivation tree can be used 
to reconstruct the derivation of a string recognized 
by a TAG. Figure || shows an example of a deriva- 
tion tree and the corresponding parse tree for the de- 
rived structure obtained when a(John) and a(beans) 
substitute into a(cooked) and (3{dried) adjoins into 
a(beans) giving us a derivation tree for John cooked 
dried beans. Trees that adjoin are termed as auxiliary 
trees, trees that are not auxiliary are called initial. 
Each node in the derivation tree is the name of an 
elementary tree. The labels on the edges denote the 
address in the parent node where a substitution or 
adjunction has occured. 

Definition 1 In a TAG G — {7 | 7 is either an ini- 
tial tree or an auxiliary tree }, we will notate adjunc- 
tion (similarly substitution) of trees 71 . . . 7/c into tree 
7 at addresses ax ■ ■ . <Xfc giving a derived tree 7' as 

7' = 7[ai,7i] . . . [a fc ,7 fc ] 
Definition 2 Given two standard TAGs Gt, and 



Gr we define (from (Shieber, 1994)) a STAG as 
{(7>7 / >'~ % ) I 7 G Gl,7' G G r }, where ^ is a set 
of links from a node address in 7 to a node address 
in 7' . A derivation proceeds as follows: 

• for 7 = (7l,7_r,^), pick a link member ^ 
an, where the a's are node addresses and G 

For simplicity, we refer to as link and its 
elements ^ as link members. 

• adjunction (similarly substitution) of 
((3l, (3r, into 7 is given by 

<7L7W} = {lL[a L ^L],lR[a R ^R]^") 
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Figure 1 : Example of a TAG 
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Figure 2: Example of a derivation tree and corresponding parse tree 



where all links in ^ and are included in 
except <-^i. 

• (Tl> 7ri '~"") * s 71010 a derived structure which 
can be further operated upon. 



In flAbeillc, 199g ; |Abeille, 1994| ) STAGs have been 
used in handling non-local dependencies and to seper- 
ate syntactic attachment from semantic roles. How- 
ever, STAG cannot be used to seperate the dependen- 
cies created in (pairs of) derivation trees for coordi- 
nate structures from the constituency represented in 
these derivation trees. In this particular sense, STAG 
has the same shortcomings of a TAG. Also the above 
definition of the inheritance of links in derived struc- 
tures allows STAG to derive strings not generable by 
TAG ( |5hieber, 1994| ). We look at a modified ver- 
sion of STAGs which is weaker in power than STAGs 
as defined in Defn ||. We call this formalism Link- 
Sharing TAG (LSTAG). 



Definition 3 An LSTAG G is defined as a J^-tuple 
(Gl, Gr, A, $) where Gl,Gr are standard TAGs, A 
and <I> are disjoint sets of sets of links and for each 
pair 7 = {jl,"/r), where j L E G L and j R £ Gr, 
5-y E A is a subset of links in 7 and <f>~ lR E $ is a 
distinguished subset of links with the following prop- 
erties: 

• for each link <~-> E 4>-y R , fj ^Tj, where n is a node 
address in jr. i.e. 4> lR is a set of reflexive links. 

• 5r and 4> JR have some canonical order -<. 

• adjunction (similarly substitution) of ((3l,(3r) 
into 7 is given by 

(7z,>7fl> = {lL[a L , Ph]-,lR[aR, Pr]) 

and for all 7^ E <5 7 ,/?i E (f>p R (l < i < n) 
(card(S 7 ) > card((3R)) 



5 1 U . 



def 



Ti 



U...U 



7n 



3 



whe 



and 



is a set of links defined as follows. 



It is important to note that our initial motivation 
about seperating dependency from the constituency 
information is highlighted in (3 (see Fig. ^(b)) where 
the first projection will only contribute information 
about constituency in a derivation tree while the sec- 
ond projection will contribute only dependency infor- 
mation in a derivation tree. We conjecture that this 
is true for all the structures defined in an LSTAG. 



aR,i and Or" Or. , the 



If the kind of questions addressed in ( Rambow, Vijay- 
Shankcr, and Weir, 1995| ) can perhaps be answered 



U 



def r 



a R( } U {a Lz <-< a Rj } 



within the framework of LSTAGf 



• {i'liIr) * s ^ e new derived structure with new 
set of links <5 7 U <pp R . 

$ is used to derive synchronized parallelism in Gr. 
The ordering -i, is simply used to match up the links 
being shared via the (non-local) sharing operation U. 

This ordering -t, can be defined in terms of node 
addresses or "first argument -< second argument" , i.e. 
ordering the arguments of the two predicates being 
coordinated. 

It is important to note that only the links in $ are 
used non-locally and they are always exhausted in 
a single adjunction (or substitution) operation. No 
links from A are ever inherited unlike STAGs. Hence, 
non-locality is only used in a restricted fashion for 
the notion of 'sharing'. 

4 Linguistic Relevance 

To explain how the formalism works consider sen- 



tence |(2] 
(2) 



John cooks and eats beans. 



Consider a LSTAG G = {7, (3, a, v} partially shown 
in Fig. ||(a) and Fig. ||(b). a and v are analogously 
defined for John and beans respectively (see Fig. |l|). 
In Fig. |(a) <5 7 = {l,2]f]and <f> lR = {}, while for 
Fig. |(b) 5 1 = {} and ^> lR = {1,2}. 

5 We are just using numbers 1,2,... to denote the links 
rather than use the Gorn notation to make the trees easier to 
read. Here, link number 1 stands for 1 1 and 2 stands for 
2.2 ^ 2.2 
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Figure 3: Trees 7 and f3 from LSTAG G 



is 



The derived structure after (3 adjoins onto 7 
shown in Fig. 0(a). Fig. ||(a) shows the derived tree 
after the tree a (for John) substitutes into 7. Notice 
that due to link sharing, substitution is shared, effec- 
tively forming a "tangled" derived tree^. In Figs. |t] 



6 In ( Rambow, Vij ay- S hanker, and Weir, 1995| ) a new for- 
malism called D-Trce Grammars was introduced in order to 
bring together the notion of deri vation tree in a TAG with the 
notion of dependency grammar ( Mcl'cuk, 1988 1. Perhaps the 
dnd nf que stions addressed in (Kambow, Vijay-Shanker, and 
Weir, 1995 ) can also be handled using the current framework. 
Such an application ofUie formalism would motivate the need 
for trees like 7 in Fig. H independent of the coordination facts 
since they would be required to get the dependencies right. 

7 While this notion of sharing bears some resemblance to 
the notion nf joining nnrfp in the th ree-dimensional trees used 



ill dMoltmann, 1992| ; |Muadz, 199l| ) the rules for semantic in- 
terpretation of the derivations produced in a LSTAG is con- 
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and H the derivation trees are also given (associated 
with each element). The derivation structure for 
the second element in Fig. |5|(b) is a directed acyclic 
derivation graph which gives us information about 



dependency we expect. The derivation tree of the 
first clement in Fig. |5|(b), on the other hand, gives us 
information about constituency. 

The notion of link sharing is closely related to the 



1997b) shown below in combinatory notation. 



bxy = bxy 

bfg = Xx.b(fx)(gx) 

bfg = Xx.Xy.b(fxy)(gxy) 



Link sharing is used to combine the interpretation 
of the predicate arguments / and g (e.g. cooks, eats) 
of the conjunction b with the interpretation of the 
arguments of those predicates x, y, . . .. However, it 
does this within a tree-rewriting system, unlike the 
use of combinators in (Steedman, 1997b). 



5 Restrictions 

Having defined the formalism of LSTAG, we now de- 
fine certain restrictions on the grammar that can be 
written in this formalism in order to capture correctly 
certain facts about coordinate structures in English. 

For instance, we need to prohibit elementary struc- 
tures like the one in Fig. ^ b ecau se they give rise to 



ungrammatical sentences like (3) . 
(3) *Peanuts John likes and almonds hates. 



(Joshi, 1990) 



However, such restrictions in the context of TAGs 



have been discussed before. ( |Joshi, 1990|) rules out 
(3) by stating a requirement on the lexical string 
spelled out by the elementary tree. If the lexical 
string spelled out is not contiguous then it cannot 

siderably less obscure than the rules needed to interpret 3D 
trees; crucially because elementary structures in a TAG-likc 
formalisms are taken to be semantically minimal without be- 
ing semantically void. 



coordinate. This requirement is stated to be a phono- 
logical condition and relates the notion of an intona 
tional phrase (IP) to the notion of appropriate frag- 
ments for coordination (in the spirit of ( (Btcedman 



1997a)). It is important to note that the notions 



of phrase structure for coordination and intonational 



phrases defined in (Joshi, 1990) for TAG ar e not iden- 



-ichc-matization of the coordination rule in (|Steedman, 1997a ) 



tical, whereas they are identical for CCG ( Bteedman 



We can state an analogous restriction on the for- 
mation of elementary structures in a LSTAG, one 
that is motivated by the notion of link sharing. The 
left element of an elementary structure in a LSTAG 
cannot be composed of discontinuous parts of the 
right element. For example, in Fig. |^ the segment 
[s[jvpJ[vp]] from the right element has been excised 
in the left element. This restriction corresponds to 
the notion that the left element of a structure in a 
LSTAG represents constituency. 



6 Conclusion 



We have presented a new tree-rewriting formal- 
ism called Link-Sharing Tree Adjoining Grammar 
(LSTAG) which is a variant of synchronous TAGs 
(STAG). Using LSTAG we defined an approach to- 
wards coordination where linguistic dependency is 
distinguished from the notion of constituency. Ap- 
propriate restrictions on the nature of elementary 
structures in a LSTAG were also defined. Such an 
approach towards coordination that explicitly distin- 
guishes dependencies from constituency gives a better 
formal understanding of its representation when com- 
pared to previous approaches that use tree-rewriting 
systems which conflate the two issues (see ( Joshi and 
schabes, 1991; Sarkar and Joshi, 1996)). The pre- 



vious approaches had to represent coordinate struc- 
tures either with unrooted trees or by performing 
structure merging on the parse tree. Moreover, the 



linguistic analyses presented in ( Joshi and Schabcs 
1991; Sarkar and Joshi, 1996| ) can be easily adopted 



in the current formalism. 
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